Rescaled range and transition matrix analysis of DNA sequences
نویسندگان
چکیده
In this paper we treat some fractal and statistical features of the DNA sequences. First, a fractal record model of DNA sequence is proposed by mapping DNA sequences to integer sequences, followed by R/S analysis of the model and computation of the Hurst exponents. Second, we consider transition between the four kinds of bases within DNA. The transition matrix analysis of DNA sequences shows that some measures of complexity based on transition proportion matrix are of interest. The main results are: 1) Hexon > Hintron for virus. But Hintron > Hexon for the species which have the shape of cell except for drosophila. 2) For Virus, E. coli, yeast, drosophila, mouse and human, measures H of transition proportion matrix of exon is larger than that of intron, and measures λ, D, C, D̃ and C̃ of transition proportion matrix of intron are larger than that of exon. 3) Regarding the evolution, we find that when the species goes higher in grade, the measures D, C, D̃ and C̃ of exon become larger, the measure H of exon becomes lesser except for yeast. Hence for species of higher grade, the transition rate among the four kinds of bases goes further from the equilibrium.
منابع مشابه
Evaluation of First and Second Markov Chains Sensitivity and Specificity as Statistical Approach for Prediction of Sequences of Genes in Virus Double Strand DNA Genomes
Growing amount of information on biological sequences has made application of statistical approaches necessary for modeling and estimation of their functions. In this paper, sensitivity and specificity of the first and second Markov chains for prediction of genes was evaluated using the complete double stranded DNA virus. There were two approaches for prediction of each Markov Model parameter,...
متن کاملA comparative phylogenetic analysis of Theileria spp. by using two two "18S ribosomal RNA" and "Theileria annulata merozoite surface antigen" gene sequences
More than 185 species, strains and unclassified Theileria parasites are categorized in the Entrez Taxonomy. The accurate diagnosis and proper identification of the causative agents are important for understanding the epidemiology, prevention and appropriate treatment. This study aims to discuss the importance of two genes of Theileria annulata 18S ribosomal RNA (18S rRNA) and Theileria annulata...
متن کاملAn Evolutionary and Phylogenetic Study of the BMP15 Gene
DNA sequence data contains a wealth of biologically useful information. Recent innovations in DNA sequencing technology have greatly increased our capacity to determine massive amounts of nucleotide sequences. These sequences can be used to specify the characteristics of different regions, interpret the evolutionary relationships between categorized groups, likelihood of performing multiple com...
متن کاملThe Investigation of Mutations and Comparison of Leptin Gene Pro-Motor in Najdi Cattle with the Database NCBI Sequences
Objective: Identity the genetic aspects and major gene influence on energy balance, milk production, fertility, food safety and consumer are the recent interests of genetic and breeding researchers. Methods: Najdi Cattle is the most prominent breeds in Khuzestan province. To do this plan in Shoushtar Najdi Cattle Station, blood samples were taken from 15 Najdi Cattles. DNA was extracted from wh...
متن کاملOn the Spaces of $lambda _{r}$-almost Convergent and $lambda _{r}$-almost Bounded Sequences
The aim of the present work is to introduce the concept of $lambda _{r}$-almost convergence of sequences. We define the spaces $fleft( lambda _{r}right) $ and $f_{0}left( lambda _{r}right) $ of $ lambda _{r}$-almost convergent and $lambda _{r}$-almost null sequences. We investigate some inclusion relations concerning those spaces with examples and we determine the $beta $- and $gamma $-duals of...
متن کامل